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Molecular Modeling 


Drug design is an iterative process which begins when a chemist identifies a 
compound that displays an interesting biological profile and ends when both the 
activity profile and the chemical synthesis of the new chemical entity are 


optimized. 


Drug design has two major classes: 
1. Indirect drug design "ligand-based": 


This relies on knowledge of other molecules that bind to the biological target of 
interest. These other molecules may be used to derive 'a pharmacophore’ model 
that defines the minimum necessary structural characteristics a molecule must 
possess in order to bind to the target. In other words, a model of the biological 
target may be built based on the knowledge of what binds to it and this model in 
turn may be based on the knowledge of what binds to it, and this model in turn 


may be used to design new molecular entities that interact with the target. 


2. Direct drug design "Structure-based"': 


Which relies on knowledge of the three dimensional structure of the biological 
target obtained through methods such as x-ray crystallography or NMR 


spectroscopy. If an experimental structure of a target is not available it may be 
possible experimental structure of a target is not available, it may be possible to 
create a homology model of the target based on the experimental structure of a 
related protein. Using the structure of the biological target, candidate drugs that 
are predicted to bind with high affinity and selectivity to the target may be 
designed using interactive graphics and the intuition of a medicinal chemist or 


various automated computational procedures to suggest new drug candidates. 
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Drug Design Approaches 
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Molecular representations 


The starting point for many computer assisted molecular design (CAMD) studies 
is generally a two dimensional drawing of a compound of interest. These 
diagrams can range from notebook or "back-of-the-envelope" sketches to 
electronically stored connection tables in which one defines the types of atoms in 
the molecule, their hybridization and how they are bonded to each other. Carbon 
dioxide, for example, would be defined as one SP2 oxygen atom (atom number 1) 
bonded to an SP carbon atom (atom number 2) with a double bond which in turn, 
is bonded to a second SP2 oxygen atom with a double bond. The connection table 


which defines carbon dioxide might be written as shown down. 


atom # Atom Name Atom Type Bound to Atoms 





Figure 1. Connection Table for Carbon Dioxide 


Connection tables are easily stored and searched electronically. However, they 
must be transformed into three dimensional representations of chemical structure 
to study chemical properties. Chemists use the mathematical descriptions of the 
rules of physical chemistry which are contained in quantum mechanics and 


molecular mechanics to accomplish this task. 


Aim: we introduce some of the methods used for the computational representation 


of molecular structures and the creation of structural databases. 
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A. 2D structure representations 
1. The connection Table 


Name of constant Value Description 


btAromatic 
btDouble double bond 


btNoBond | 0 | bond undefined 
btSingle single bond 
btTriple triple bond 


indek Atomic Atom Connected bond 
number type to atom 





Figure2. The connection table for aspirin in the MDL format (hydrogen-suppressed form). The 
numbering of the atoms is as shown in the chemical diagram. 


- Build the table for the following compounds: 
Aspirin 
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cry 


2. SMILES STRING 





Acetanilide 


A SMILES string is composed of atom specifications arranged sequentially 
according to connectivity in the molecular graph. 


Example 


O(C(=O)C)cleccec1C(O)=O 


hy CY O 


- Draw structure ---> Edit ----> Copy as (2D MDL, SMILES) 


B. 3D structure representations 
- Types of modes : Point/ Line /stick / Ball and line / Ball and stick/Space filling 
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Draw structure ---> Mode -> what ---> Edit > Copy as (3D MDL, export images as 
BMP) 
Record the xyz coordinates of each atom and try to change it 
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Energy minimization 


Aim: The Energy Minimize application in MOE calculates atomic coordinates that 


are local minima of a molecular energy function. Energy minimization is useful 


for molecule building, determining low energy conformations, conformational 


search and preparation for molecular dynamics simulations and vibrational 


analysis 


Insummary: Energy minimization can be used to find a reasonable and energetically preferred 3D 


conformation of the molecule. 


Steps: 


a. 


b. 


Summarize in your own words how energy minimization basically works. 


Open the SVL window to monitor the following computation steps. Measure 
the initial potential energy of your ligand. 


E(x) S Estr + Eang + E stb + E oop + Etor + Evaw ar Eele + Eso) + Eres 


Er is the bond stretch energy 

Eng IS the bond angle bend energy 

Exp is the stretch-bend energy 

Esop is the out-of-plane energy 

Eor is the dihedral angle, or torsion energy 
Evaw is the van der Waals energy 

Esie is the electrostatics energy 

E.o is the solvation energy 


Ees is the restraint energy 


Perform an energy minimization on your ligand. How does the conformation 
of the molecule change? What is the potential energy of the minimized structure? 
Which terms show the most significant changes? Can you give reasons for 


that? Save the minimized structure as moe file. 
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How to apply: 


- Compute >>> potential energy ----- record the value of the total energy of the ligand 
- Compute >>> energy minimize 


- Compute >>> potential energy ----- record the value of the NEW total energy of the ligand 
Penicillin G 
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Identification and assay of penicillin 





Ampicillin 


OA 


Na 


j 
1. Functional group analysis: - 
COOH, NH» ,C=0 


a. Reaction with Na” CO; 
b. Reaction with FeCl, 


2: 


eff. with evolving of CO». 
Colored solution or precipitate 


Spectroscopic identification of the drug 





FEB Rag FEB 
3500 3000 2500 nC 
IR. Amoxicillin trihydrate+ Gum karaya 
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H-6 
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HNMR 


H-5 
4 = 8.5 Hz 





3. Assay of the drug: using acid-base titrations 





Ampicillin solution in acetonitrile-water mixture is titrated with standard sodium hydroxide 
solution. The end point is determined by pH measurement or using of Phenolphthalein as 
indicator. 


Diye eo We Ahi 


LP os One® 

Principle: Ampicillin is semi synthetic penicillin, which is used as a broad-spectrum 
antibiotic. The principle involved in assay is acidimetric titration. Ampicillin is stable to 
acids & being acidic in nature it can be assayed directly by titrating with base. It is a 
monobasic acid, which consists of one carboxylic acid, and therefore it consumes Imole of 
NaOH solution. Thus, it can be estimated by no. of moles of NaOH consumed in the 
titration. The titration is done with NaOH after dissolving the sample in the free water 


dissolving the sample in free water (freshly boiled and cooled). 
Procedure: 


e Weigh accurately quantity of powder equivalent to 0.5g of Ampicillin. 


e Transfer the powder into a conical flask containing 25ml of freshly boiled and cooled 
distilled water. 


e This solution aside for 10min and then add phenolphthalein indicator. Titrate the solution 
with 0.1 N NaOH until permanent pink color is obtained. 
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e Equivalent Factor: 1 ml of 0.1 N NaOH = 0.03495 gm of Ampicillin. 

e Report: The amount of ampicillin present in the given sample was found to be ------------ 
e % purity was found to be---------------- l 

Test yourself: 

If the mls of titrant = 14 ml 

Ampicillin weight in gm = 


% purity = 

Database creation 
This chapter explains the basic steps involved in creating and editing databases containing 
numeric, text and molecular data entries. If you are looking for information on a specific menu 
or command, click on the links in the Database Viewer Menu Bar. 


The MOE interface provides several ways of creating databases: 


e Inthe MOE Window (and also in the Sequence Editor), choose File | New | Database. 
e Inthe Database Viewer, choose DBV | File | New. 


The following panel appears: 








s/ New Database -Ioj x] 
$ yy TY work” mdb (3). cwo 
Name Type Size Date — 

= 
4 L 
| vo *mdb 
Fields 


Type: molecule * Name: mol 
Type: float * Name: 
Type: char 7 Name: 


OK Cancel 
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In the New Database panel, specify the path and name of the new database. The lower portion 
of the panel allows you to create three fields in the new database. If the name fields are left 
blank then no fields will be created. By default, a single molecule field is created with the field 
name of mol. 


Press OK to create the new database and open it in a Database Viewer. 


Creating Fields 
Fields hold numeric, character, or molecular data. Operations on fields are done via the Edit menu or 
the Field Popup menu (position the cursor over the field name and press the mouse popup button). 


1. Inthe Database Viewer, choose DBV | Edit | New | Field. You can also use the Field Popup 
menu: DBV | Field Popup | New. 


The CLI prompt area will display an option menu with the available database data types 
and a text field for entering the new field name. Non-molecule fields (i.e. numeric or 
character) can also take a value. The value will be replicated in the new field across all 


entries. 


* New Field Type: int * Name: | Value: OK Apply Cancel 


2. Select the type of field to create: 


Line Numeric Integer value or vector of integers 

Poat Numeric Real number or vector of real numbers (4-byte IEEE floating point data) 
Char Text String of characters 

byte Numeric 8-bit numerical value ranging from -128 to 127 inclusive 


molecule Molecular MOE molecular format (see mol Extract) 


double Numeric Real number or vector of real numbers (8-byte IEEE floating point data) 


3. Enter the name of the new field and, if desired, a value used to initialize the field. Press 
Return to create the field. The field names appear as column labels in the top row of the 
database. You can also use the Apply button to create the field. Using the OK button will create 
the field and dismiss the prompter. 

4. Repeat this procedure to create as many fields as required by the database you are creating. 
Once this is done, you can add entries to the database, as described below. 


The Field Popup | Duplicate command can also be used to create a new field. In this case, an 
exact copy of the field is created. You will be asked to confirm the field creation. The name of 
the new field is automatically generated from the original field name by prefixing it with 'Copy 
of '. Ifa field of this name already exists, the new field will be numbered (e.g. 'Copy (2) 
of...'). The new field is positioned immediately to the right of the original field. 
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Multiple fields can be duplicated using the Field Popup | Selection | Duplicate command. The 
copied fields are appended, 1.e. are positioned as the rightmost columns of the database. 


Adding Entries 


Entries are commonly added to a database automatically through many of MOE's applications. 
However, it is often required to add entries manually to a database in order to store data 
currently viewed in the MOE Window. This section describes how to add entries to a database 
and how to filter the system in order to copy only parts of the system into the new database 
cells. 


To open the Add Entry panel use either of the following commands: 


e DBV | Edit | New | Entry 
e DBV | Entry Popup | New 


A panel similar to the following appears: 


ef Add Entry -|0| x| 


Database: c:/work/bb.mdb 








Molecule: mol ¥ Visibility: Ignore 7 
Mode: System T Selection: lgnore 7 
Sequence Only 
activity: | 
name: 





The panel consists of two sections: the Molecular Field and the Non-Molecular Fields. If 
there are no fields of the specified type then that section will not be displayed. 


To add a single new entry with no data initialization, set the Mode option to None and press 
OK. To add a single new entry with the entire system copied into the specified molecule field, 
simply press OK. 
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QSAR Tutorial 


This tutorial has been broken down into the following steps: 


l. Calculating Molecular Descriptors 
II. Fitting the Experimental Data 
II. Cross-Validating the Model 
IV. Performing Graphical Analysis 
V. Estimating the Predicted Activities for the Test Set 


VI. Pruning the Descriptors 
VII. References 


The Quantitative Structure-Activity Relationship (QSAR) suite of applications in MOE is used 
to analyze experimental data and build numerical models of the data for prediction and 
interpretation purposes. Given a set of molecules whose activity in a particular experiment is 
known (referred to as a training set or learning set), a QSAR model correlates these activities 
with properties inherent to each molecule in the set. These properties are evaluated using 
molecular descriptors available in MOE. 


The particular example chosen for this exercise is the Selectivity Differences of Inhibitors 
Binding to Trypsin [Bohm, 1999; Stiirzebecher, 1991, 1993, 1995]. The entire training data set 
consists of 72 molecules derived from 3-amidinophenylalanine and the test set has 16 molecules 
similarly derived. The molecules in the training set are aromatic and polar, and properties such 
as molar refractivity and the logarithm of the (octanol/water) partition coefficients (logP) are 
important in describing such systems. 


Molecular descriptors are calculated using QuaSAR-Descriptor. 
In this tutorial you will learn how to: 


1. Choose appropriate molecular descriptors, evaluate them for each molecule in the training set 
and store them in a database. 

Fit the experimental activity to these descriptors using a linear model. 

Cross-validate the fit and perform graphical analysis. 

Estimate the activities of a set of related molecules known as the test set. 

Prune a set of descriptors to obtain a more relevant descriptor set using QuaSAR-Contingency. 


web rebated 


Conventions Used in This Tutorial 
MOE MOE Window 
SE Sequence Editor 


DBV Database Viewer 


This tutorial assumes that you are using a three-button mouse. For information on using a two- 
button mouse see Using the Mouse. 
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I. Calculating Molecular Descriptors 
1. Open the database that contains the training data set with experimental activities: 
MOE | File | Open | SMOE/sample/mol/trpstrain.mdb 


2. Because this database is read-only, you must save it in your own working directory in order to run 
calculations on the data: 


DBV | File | Save As | your local directory/mytrpstrain.mdb 


The training set consists of 71 molecules derived from 3-amidinophenylalanine. It 
contains the structure and activity (pK;) of each molecule. The index field is the 
molecule number given in the original paper. 


To get a better view of the molecules in the database, position the cursor over any one of 
the cells in the first column (called molecule), press the left mouse button and drag 
down (and slightly to the right) until you clearly see all the molecular drawings. You can 
rotate each drawing by dragging the middle mouse button over it. 





ef Database Viewer : $DESKTOP/trpstrain.mdb -Ioj x] 
File Edit Display Compute Window Help SVL DBV MOE 1 


— | molecule | index | _pki-trypsin 





A - 


4 > 























71 entries, 0 selected, all visible. 3 fields, 0 selected, all visible. AA 





3. The next step in a QSAR analysis is to choose suitable molecular descriptors: 
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DBV | Compute | Descriptors | Calculate 


The Calculate Descriptor panel includes the list of all the descriptors available in MOE: 






































ef Calculate Descriptors -Ioj x] 
Database File: s:/mytrpstrain.mdb Selected Entries Only 
Molecule Field: molecule ¥ 
Auto Select Database Fields Selected Database Fields Clear Selection 

Descriptors Selected: 0 

Code Class Description à 

AMI dipole i3D Dipole moment =| 

AMI E 30 Total energy (kcal/mol) 

AMI Eele 30 Electronic energy (kcal/mol) 

AMI HF i3D Heat of formation (kcal) 

AMI HOMO 30 HOMO energy (eV) 

AMI IP 30 lonization potential (kcal/mol) 

AMI LUMO 30 LUMO energy (eV) 

apol 2D sum of atomic polarizabilities 

ASA i3D Water accessible surface area 

ASA+ 3D Positive accessible surface area 

ÀSA- i3D Negative accessible surface area y 

4 > 
Class: All 2D Protein i3D x30 

Filter: | Apply Clear 





The following 18 descriptors will be used: 


SLOGIE Wise SMR VSAO 
SLOGIe Voal oMR VSA1 
SLOG IS WS SMR VSAZ 
SILOG IE WV SINS SMR VSA3 
SlogP VSA4 SMR VSA4 
SLOG WSIS SMR VSA5 
SlogP VSA6 SMR VSA6 
Slog)? Woe SMR _VSA7 
SILOGIE ona 

SONG WISIN, 


An easy way to select the descriptors is to use the Filter at the bottom of the panel: 


a. To select the first column of descriptors, enter s1 in the Filter text field. Only descriptors with the 
characters s1 are listed. Click on SlogP_VSAO, press Shift and click on SlogP_ VSA9. Notice that 
the Descriptors Selected count now indicates that 10 descriptors are selected. 
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s/ Calculate Descriptors -Ioj x| 
Database File; s:/mytrpstrain.mdb Selected Entries Only 
Molecule Field: molecule ¥ 

Database Fields selected Database Fields Clear Selection 


















Code Class Description á 
GCUT SLOGP 1 2D LogP GCUT (1/3) 

GCUT_SLOGP_? 2D LogP GCUT (2/3) 

GCUT_SLOGP_3 2D LogP GCUT (3/3) 

slogP 2D Log Octanol/Water Partition Coefficient 


[a | 


SlogP_VSA0 i Bin 0 SlogP (-10 -0.40] 

SlogP_V ; Bin 1 SlogP (-0.40,-0.20] 

SlogP V : Bin 2 SlogP (-0.20, 0.00) 

Bin 3 SlogP { 0.00, 0.70] | 
Bin 4 SlogP ( 0.10, 0.15] | 
Bin 5 SlogP (0.15, 0.20] 

Bin 6 SlogP (0.20, 0.25] 

SlogP_VSA Bin 7 SlogP (0.25, 0.30] 

SlogP_V Bin 8 SlogP ( 0.30, 0.40] 

SlogP_VSA9 Bin 9 SlogP ( 0.40,10] 


hi 


Aa 


Jom tn p tu 





20 Protein 13D ¥3D 






Apply, Clear 






b. Now enter sm in the Filter text field. Only descriptors with these characters appear. Select the 
series of SMR_VSA descriptors using Shift (the descriptors selected previously remain selected 
even though they are not currently visible in the panel). The panel should report that 18 
descriptors have been selected. 


A brief description appears beside each descriptor. For more information, see QuaSAR- 
Descriptors. 


4. Press OK once the set has been selected. MOE will evaluate each descriptor for every molecule and 
store the values in mytrpstrain.mdb. New fields are named based on the descriptor codes. 

5. Now suppose you need to add a new molecule (e.g. trypsin72 which has a pK; of 3.0) to the 
database and would like to calculate its descriptors as well. Fortunately, it is not necessary to re- 
select and re-evaluate all the descriptors in the Calculate Descriptors panel as was done previously. 
Instead, you can tell MOE to evaluate the descriptors for the new database entry only. 

a. Open the molecule into MOE: 


MOE | File | Open | SMOE/sample/mol/trypsin72.moe 
b. Add the new molecule to mytrypstrain.mdb: 
DBV | Edit | New | Entry 
The Add Database Entry panel lists all the fields in the database. Next to the 
molecule field are buttons specifying which atoms to copy from MOE. All other 


database fields have text fields where you can enter values. To add the molecule 
to the database: 
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= Enter its index number: 72. 
= Enter its pki-trypsin value: 3.0. 
= Press OK. The molecule is appended to the database, becoming entry number 72. 
= Remove the molecule from the MOE system using MOE | RHS | Close. 
c. Before calculating the descriptors for this molecule, you must first select the new entry in the 
database. To do so, simply click on the entry number, i.e. number 72 in the left-most column. 
d. Reopen the Calculate Descriptor panel: 


DBV | Compute | Descriptors | Calculate 
In the Calculate Descriptor panel: 


= Turn on the Selected Entries Only option. 
= Click on Auto Select: Database Fields. This automatically reselects (in the Calculate Descriptor 


panel) all of the descriptors that were previously calculated in the database. Notice that the line 


above the list states: Descriptors Selected: 18. 





[ioj x| 


ef Calculate Descriptors = | 
«l Selected Entries On ly 








Database File: s:/mytrpstrain.mdb 
Molecule Field: molecule 
Auto Sele ataba: ield Selected Database Fields Clear Selection 









Descriptors Selected: 1¢ 
Code Class Description á 
AM1_dipole i3D Dipole moment 
AMA_E i30 Total energy (kcal/mol) 
AMI Eele i3D Electronic energy (Kcal/mal) 
AM1_HF i3D Heat of formation (kcal) 
AMA_HOMO i30 HOMO energy (eV) 
AMI IP i3D lonization potential (kcal/mol) 
AMI LUMO i3D LUMO energy (eV) 
apol 20 Sum of atomic polarizabilities 





= Ensure that entry 72 is the only selected entry in the database and click OK to calculate the values 


for the descriptors of the new molecule. Then deselect entry 72: 


DBV | Edit | Clear | Entry Selection 


II. Fitting the Experimental Data 


The next step is to fit the dependent variable, pki-trypsin, to the independent variables, 
namely the molecular descriptors, in the database. 


1. Open the QuaSAR-Model panel: 
DBV | Compute | Model | QSAR 


2. Inthe panel: 
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»/ QuaSAR-Model -|5) x| 
Database: s:/mytrpstrain.mdb Selected Entries Only 





Status: PLS Model: RMSE=@.486826 R2=6.689020 
Method: PLS ¥ 





pki-trypsin ¥ 


Weight Field: (NONE) T 
Options: Component Limit: el Condition Limit 1e+886 
Binary Threshold: @ smooth: @.25 
Model: Desc riptor Coefficient Normalized 4 
index 


pki-trypsin 
SlogP VSA8 
SlogP VSAl 
SlogP VSA2 
SlogP VSA3 
SlogP VSA 
SlogP VSA5 
SlogP VSA 7 
SlogP VSA? 1, 02546 1.7962 
| 


al 


Ga P 

Pa 0 
J 

D G 

D fa 


D ua D 
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Di ua 
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T= 
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= 
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sort by Normalized sort by Name Select Database Fields 
Fit Report Validate Save Close 





a. As Activity Field, select: pki-trypsin. 
b. Choose all the descriptors except for pki-trypsin and index. 


Tip To select a range of descriptors in the list, click on the first item, 
SlogP VSAO, scroll to the end of the list and press Shift as you click on the last 
item, SMR_VSA7. This selects the entire range of descriptors. 

c. Press Fit to perform the regression analysis. The root mean square error (RMSE) and r” 
(R2) values of the fit appear at the top of the QuaSAR-Model panel, in the Status field. 

d. Press Report to view a report on the calculations in the SVL Text Editor window. You 
can save or discard the text file before closing the SVL Text Editor. 

e. Press Save in the QuaSAR-Model panel, and save the model as trypsinpred. fit. 
This model will be used to predict the activities of the candidates in the test data set. 


Do not close the QuaSAR-Model panel yet. 
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III. Cross-Validating the Model 
Having performed the QSAR fit, it is now useful to validate the model. The validation 
procedure evaluates the predicted activities and the residuals for the molecules in the training 


set. In cross-validation, the model is tested using a LOO (Leave One Out) method. 


1. Click on Validate in the QuaSAR-Model panel. The QuaSAR Fit Validation panel appears: 


»/ QuaSAR Fit Validation ] 10) x! 





Database: s:/mytrpstrain.mdb 
Model Validation: T Write Predictions 
/ Write Prediction Residuals 


~ Write Prediction Z-Scores 
Cross Validation: / Write P 

J Write Prediction Residuals 

J Write P 


redictions 














réediction #-Scores 


OK Cancel 








2. Select all the Model Validation and Cross Validation properties and press OK. (You can now 
close the QuaSAR-Model panel.) 


$PRED, $RES and $Z-SCORE fields are created in mytrpstrain.mdb. They contain the 
predicted, residual and Z-score values respectively. The corresponding cross-validation 
properties, $XPRED, $XRES and $XZ-SCORE, are also calculated. (More information is 


available in QuaSAR-Model.) 

IV. Performing Graphical Analysis 

A $Z-SCORE value of 2.5 and above can be considered to indicate molecules which are outliers 
to the fit. These might require new descriptors. You can graphically display which molecules 
are outliers in the database using MOE's plotting facilities. 


1. To display the plot area in the bottom half of the Database Viewer: 


Toggle on DBV | $Z-SCORE Field Popup | Plot or the Plot icon in the database viewer 
footer. 


2. You can plot the SZ-SCORE values using the Plot Fields drop-down menu: 
Plot Fields | $Z-SCORE 
3. Click the Fit button in the database viewer footer. This adjusts the display so that all data 


points are visible in the plot area. Each dot represents an entry (in other words, a molecule in 
the database). 
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As you can see, two dots exceed the 2.5 mark. (Since there are only two outliers in this 
example, the choice of descriptors can be considered adequate.) Position the cursor over 
a dot without moving for a second or so. The number of the entry will appear in yellow 
bubble help. Select the dots by clicking left in the vertical ruler at 2.5, and dragging 
upward. This causes a selection box to appear. Releasing the mouse causes all points in 
the selection box to become selected. (You can also drag a selection box in the plot area 
itself.) 


Another type of plot used to evaluate the data results is the XY plot. By plotting the actual 
activities (X-axis) versus the predicted activities (Y-axis), the predictive ability of the model can 
be assessed. To show the linear relationship between actual and predicted activities, open the 
correlation plot panel: 

DBV | Compute | Analysis | Correlation Plot 


and select: 


a. Selecta field to plot in the X-axis: choose pKi-trypsin 
b. Selecta field to plot in the Y-axis: choose SPRED 


The correlation plot will be generated as shown below: 


lo] x! 





Correlation Plot cwork/trpstrain.mdb 
R=0.8301 R2=0.6890 ($PRED) = 0.68902 (pki-trypsin) + 1.85301 


pki-trypsin 





Attributes... Data to Clipboard Clear Selection Hide Selection Close 





As shown in the above correlation plot, a linear relationship exists between the actual 
and predicted activities. The correlation coefficient is displayed in the top of the plot. To 
further establish the reliability of the QSAR model, apply the model against an external 
test set to evaluate the predictive ability. 
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V. Estimating the Predicted Activities for the Test Set 


Using the predicted fit trypsinpred. fit obtained above, you can now evaluate the predicted 
pK; values of the test set of molecules. The test set includes 16 molecules derived from 3- 
amidinophenylalanine. 


1. DBV | File | Open | SMOE/sample/mol/trpstest.mdb 
2. Because this database is read-only, you must save it in your own working directory in order to 
run calculations on the data: 


DBV | File | Save As | your local directory/mytrpstest.mdb 


3. DBV | Compute | Model | Evaluate 





ef Model-Evaluate | -Ioj x] 
Database: s:/mytrpstest.mdb Browse... 
Model File: |S: /trypsinpred. fit Browse... 


Field: thypsin-pred 
Entries: All Selected Visible 


~ Use Descri ptors in Database if Available 


OK Cancel 





a. Enter the name of the Model File: trypsinpred. fit. 
b. Change the name of the database Field to: trypsin-pred. Predicted values will be 
written to this field. 
c. Press OK to evaluate the predicted values. 
4. The residuals between the experimental pK; values (oki-trypsin) for the entries in the test 
set and the predicted values (trypsin-pred) can be calculated using the Molecular Database 
Calculator: 


DBV | Compute | Calculator 


a. Inthe Available Fields area, double-click on pki-trypsin. The selected field appears in 
the equation area on the left of the panel. 

Click on the minus sign button. 

Double-click on the tryps in-pred field to insert it in the equation. 

d. As Destination Field, change the default name to: trypsin-res. 


O Oo 
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s/ Molecular Database Calculator -Ioj x| 
Database: s:/mytrpstest.mdb 
Use Selected Entries Only 
<= |Del Available Fields: 
tpki-trypsin} - {trypsin-pred} & index A 


|| pki-trypsin 
| trypsin-pred 











Y T 
4 el | eee o |> ai | 
pow log sar abs sin cos tan ceil 
exp logi0 sort cbrt asin acos atan floor 
PI e index selected RAND { ) USER 
< > max min f 3 5 + 
<— Sx mean std_dev 4 5 6 - 
== => fquart tquart 2 3 in 
and or median sum 0 . +- j 
USER Function: 
Destination Field: trypsin-res| 7 
Destination Type: float ¥ 
Evaluate Clear Close 





e. Press Evaluate. The results, i.e. the residuals for the test entries, are written in a new 
field called trypsin-res in the mytrpstest .mdb database. 
f. Close the Molecular Database Calculator. 
Scroll right, to the end of the database, to see the trypsin-res column. For the purposes of this 
exercise, let us suppose that a deviation of 1 pK; unit or more from the experimental value is a 


measure of a poor fit. Examination of the residuals for the test molecules indicates that, for the 
molecules with index 77 and 86, the fit is inadequate. 
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VI. Pruning the Descriptors 


It is sometimes necessary to prune a set of descriptors in order to select the optimum set for the 
molecules under consideration. 


1. 


Return to the database which contains the descriptor values for the training set of molecules: 
DBV | File | Open | mytrypstrain.mdb 

Alternatively, you can use the DBV menu in the top right of the Database Viewer menu. 
We will now use QuaSAR-Contingency, a statistical application in MOE designed to assist you in 
the selection of descriptors. It will suggest a set of descriptors that best describes the 


molecules in the training set. 


DBV | Compute | Descriptors | Contingency 





s/f QuaSAR-Contingency -Ioj x] 
Database: s:/mytrpstrain.mdb 
Selected Entries Only 
Activity Field: pki-trypsin Y 


Descriptors: index SMR_VSA3 À 
pki-trypsin SMR_VSA4 





SlogP VSAG SMR_VS5A5 
SlogP VSAI SMR_VS5A6 
SlogP VSA2 SMR_VSA7 
SlogP VS5A3 $PRED 
SlogP VSA4 SRES 
SlogP VSA5 $2Z-SCORE 
$XPRED 
$KRES 
SlogP VSA $XZ-SCORE 


SlogP_VSAS 
SMR_VSA® 
SMR_VSA1 


SMR_VSA2 





a. As Activity Field, select pki-trypsin. 
b. Select all the descriptors, from SlogP VSAO to SMR_VSA7. 
c. Press OK to execute the contingency analysis. The results appear in an SVL Text Editor. 


Take some time to peruse the information given in this report. Recommended ranges for 
the indices appear at the top. These ranges help you to determine which descriptors are 
good and which are not. The results for each descriptor field are summarized on a 
separate line. The columns are the descriptor number, the contingency coefficient C, 
Cramer's V, the uncertainty coefficient U and the correlation coefficient R°. 


Suggested descriptors for QSAR can be found at the bottom of the report. You may 
obtain a better model using QuaSAR-Model with these suggested descriptors. 
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3. After performing the QSAR fit, it is recommended to analyze the results using Principal 
Components Analysis (PCA). The purpose of PCA is to reduce the dimensionality of set of 
molecular descriptors by linearly transforming the data. PCA is capable of calculating a new, 
often smaller, table of descriptors that are uncorrelated and normalized (mean 0 and variance 


1). 


Returning to mytrpstrain.mdb, choose: 


DBV | Compute | Descriptors | Principal Components 


d. 








af Principal Components Analysis = | O | x| 
Database: s:/mytrpstrain.mdb 
Use Selected Entries Only 
s” Autoscale Fields 
Weight Field: (NONE) 7 
Fields: index SlogP VSAB $PRED À 
pki-trypsin SlogP_VSA9 {RES | 
SlogP VSAG SMR VSAB $7-SCORE 
SlogP VSAI SMR_VSAT $XPRED 
SlogP VSA2 SAR _VSA2 $XRES 
SlogP VSA3 SMR_VSA3 $xX7- SCORE 
SlogP VS5A4 SAR_VSA4 
SlogP VS5A5 SMR_VS5A5 
SlogP_VSA6 SMR_VSA6 | 
SlogP VSA? SAR _VSA? ¥ 
ai e 
Prefix: Pcal 
Component Limit 8 Minimum Variance (%): 166 
Condition Limit 126+986 
OK Report Cancel 





Select the molecular descriptors SLogP_VSAO to SMR_VSA7. 


b. Set the Minimum Variance to 98. This is the minimum percentage of variance to retain 


in the final set of principal components. In other words, MOE will retain the number of 
principal components required to account for 98% of the variance. 


Leaving the Component Limit at its default value of zero means that there will 
be no limit to the number of components to retain. 


Press Report. The report appears in an SVL Text Editor. Save the report as pca.rep 
using File | Save As in the Text Editor. 


The columns in the table are: 
#PC Principal component number. 


Deviation Square root of the eigenvalue of the covariance matrix corresponding to the 
principal component, i.e. the standard deviation of the data along the 
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principal component vector. 


Condition Condition number of the covariance matrix if the principal component list 
were terminated at that row. 


%Var Percentage of the variance retained if the component list were truncated at 
that row. 


From the report, 12 principal components (PCA) are required to capture 98% of 
the variance in the data set. The actual mathematical form of each of the 
principal components is also given in the report. 


d. Press OK to perform the principal components analysis. 
You can obtain a graphical display of the experimental activity of the inhibitors (oki-trypsin) 
with the first three principal components: 


DBV | Compute | Analysis | 3D Plot 
a. Set the X Axis to PCA1, Y to PCA2 and Z to PCA3. 


b. Toggle off Decorrelate Axes 
c. Set the Activity to pki-trypsin. 


Label: Element MM Charge —| = 
x Name Residue Chain [eo ]— a 


cor ih Aa 


EM EER eee eee 


© Element B-Factor Residue Chain Tag 





d. Press Plot. The 3D scatter plot in the MOE Window is a visual representation of the 
molecules as described by the three selected Principal Components (PCA1, PCA2, 
PCA3). Each point in the MOE Window corresponds to a molecule and is colored 
according to the molecule's pK; value. Leave the 3D Plot window open. 

e. Render atoms in ball-and-line with MOE | Render | Atoms 
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f. As long as the 3D Plot window has been left open, selecting the points in the main 
MOE window will select the entry it represents in the Database Viewer. Additionally, 
clicking Browse in the 3D Plot panel will launch the Database Browser showing a 2D 
depiction of the currently selected molecule. Also notice that labelling the activity 
value is written into the plot atom's partial charge, which can be shown as a label from 
the MOE | Render | Atoms panel. 
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Pharmacophore Searching 


Aim: Introduction to pharmacophore generation and pharmacophore search 


A pharmacophore is a set of structural features in a ligand that are directly related to the ligand's 


recognition at a receptor site and its biological activity. 





Show: Default All None Query 
Memory: <none>| 


Acc > —— 





e.8 O 
0.6 O 





Don > —— 
E Ani: Anionic atom 
H Cat: Cationic atom 
a Don: H-bond donor 
E Don2: H-bond donor projection 

: H-bond acceptor 

il Acc2: H-bond acceptor projection 

Aro: Aromatic center 
Aro2: Aromatic ring normal 


A 
i 
K Hyd: Hydrophobic centroid 
E 
7 


SS SISI 
A 
a 


HydA: Hydrophobic atom 
Link: Link source 
mi Link2: Link projection (conjugated) 
H Link3: Link projection 
g V: Volume constraint 
J Projecting Vectors 


Entire Receptor Surface 








We will first align three active molecules, create a pharmacophore based on common superimposed 
features and then we will perform virtual screening to identify potential hits in a chemical database 


To generate 3D coordinates in the database inhouse.mdb: 
MOE | File | Open database.mdb 


MOE DBV | Compute | Energy Minimize 





To generate a set a pool of 3D conformers as in database.mdb: 


MOE | File | Open database.mdb 


Import Conformations set up the conformer generation and the 


database output name.mdb). 





1. 3D Alignment of ligands: 


MOE | File | Open ligands.mdb 

In the MOE database viewer, select the most three active compounds (Ctrl and click left) and 
right click one of these and click Copy Selected to MOE in the popup menu. Close the Database 
Viewer. 


MOE | Compute | Conformations | Flexible Alignment 
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In the Flezible Alignment window, lower the iteration limit to 20 and 














eles AA aa To Se DO. Oe Telecine Ee ie ge IA opr NAE Scenic 


Calculation Dy press img Ok. 





The calculation lasts for several minutes. A new database named flexalgn.mdb is created. It 
includes the possible alignments. 

Open ligands_ali.mdb database and compare the overlay of compounds 1, 2 and 3 in the 
different MOE-alignment solutions. Copy one alignment in MOE main GUI and close the all 
Database Viewer. 


2. Pharmacophore query generation 


MOE | New | Pharmacophore Query 


In the Pharmacophore query editor window, give a title to the Query (e.g. PPC413). It is 
possible to create the query either manually or automatically. Choose the second option by 
clicking on Consensus.... 

In the Pharmacophore Consensus windows, select MOE as input, then click on calculate. A set 
of pharmacophore features is displayed into the MOE main GUI. Select three to five key 
features and click on Load Selected. Close the window. 

In the Pharmacophore query editor window, save your query as PPC413.ph4 


Close the window. 
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3. Pharmacophore search 











MOE DBV | File | Pharmacophore search 








“Inthe Query section of the Pharmacophoric Search window, 
Browse... the PPC413.ph4 query. Click on Search to start the calculation. 


Report results in Table . 


Flexible alignment 


Aim: Look which the ligands are similar by determining which features they have 
in common 


To perform a good alignment: 


The strain energy of each molecule is small 


SoS 


Molecules have a similar shape 


Molecules have similar LogP (octanol/water) values 


a 


Molecules have comparable molar refractivities 


P 


Aromatic atoms overlap 


at 


Hydrogen bond donors and acceptors overlap 


Acidic groups and basic groups overlap 


ies 


Atoms of similar partial charge overlap 


ha © 


Hydrophobic areas overlap 


Examples: 


Oo O O o O oO 
NS AA NZ O O O 
S, A LA Ss” CH N AA 
N N SEO 3 SS FPF 
H H H H 3 
H2C CI 
H3C 


Tolazamide Chlorpropamide Tolbutamide 
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Cl NO, 
Cl NO2 
H,CH,COOC COOCH, H3C00C a COOCH; H,COOC COOCH,CH, 
H;C” ~N~ ~CH HC N CH; H;C” “N” ~CH 
3 H 3 H 3 H 3 


Felodipine Nifedipine Nitrendipine 





How to apply: 


4. Draw the three drugs using MOE Builder 
5. Compute >>> energy minimize 


6. MOE | Compute | Conformations | Flexible Alignment — mark on Refine the existing alignment 





=O) x| 


ef Flexible Alignment 


Alignment Mode: | Flexible Rigid Body | [ Refine Existing Alignment 
Output Database: | flexalgqn.mdb Browse... | 


[m Open Database Wiewer 


teration Limit: | 20 ¥ Failure Limit: | #0 Y Energy Cutoff: 1d F 


Options: [m Calculate Forcefield Charges Prior to Search 
[m Stochastic Conformational Search 


g 





7. Analyze the output by defining the similar features essential for drug activity 
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Molecular properties calculations 


Aim: study of physicochemical properties of drugs and figuring out which 
parameters are good for activity 


1. Partition coefficient (LogP values): Log of the octanol/water partition 
coefficient 
LogP is one criterion used in medicinal chemistry to assess the druglikeness of a 


given molecule, and used to calculate Lipophilic efficiency, a function of potency 
and LogP that evaluate the quality of research compounds. For a given compound 
lipophilic efficiency is defined as the pIC50 (or pEC50) of interest minus the 
LogP of the compound. 


Lipophilic Efficiency = pIC50 (or pEC50) — LogP 


- The calculated value cLogP is often used instead of the measured LogP. 

- LiPE is used to compare compounds of different potencies (pICs5 9s) and 
lipophilicities (LogP). High potency (high value of pIC50) is a desirable 
attribute in drug candidates, as it reduces the risk of non-specific, off-target 
pharmacology at a given concentration. When associated with low clearance, 
high potency also allows for low total dose, which lowers the risk of 
idiosyncratic drug reaction. 


2. Drug solubility; LogS 


Solubility, the phenomenon of dissolution of solute in solvent to give a 
homogenous system, is one of the important parameters to achieve desired 
concentration of drug in systemic circulation for desired (anticipated) 
pharmacological response. Low aqueous solubility is the major problem 
encountered with formulation development of new chemical entities as well as for 
the generic development. More than 40% NCEs (new chemical entities) 


developed in pharmaceutical industry are practically insoluble in water. Solubility 
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is a major challenge for formulation scientist. Any drug to be absorbed must be 


present in the form of solution at the site of absorption. 
3. Acid dissociation constant; pKa 


pKa is the negative logarithm (base-10) of the acid dissociation constant (Ka) of a 


solution. pKa = -logKa 
The lower the pKa value, the stronger the acid. 


- For example, the pKa of acetic acid is 4.8, while the pKa of lactic acid is 3.8. 
Using the pKa values, one can see lactic acid is a stronger acid than acetic 


acid. 


- lonizable groups indicates acid/base strength like OH phenolic, COOH, 


H 
2 US 
apa 


Sa 


4. Lipinski's rule of 5 = Pfizer's rule of five = rule of 5: 


Amines. 





H o“ ` 


Used to evaluate druglikeness or determine 1f a chemical compound with a certain 
pharmacological or biological activity has properties that would make it a likely 
orally active drug in humans and a drug's pharmacokinetics in the human body, 
including their absorption, distribution, metabolism, and excretion (“ADME"). 


However, the rule does not predict if a compound is pharmacologically active. 
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Lipinski's rule states that, in general, an orally active drug has no 
more than one violation of the following criteria: 


1. Not more than 5 hydrogen bond donors (nitrogen or oxygen atoms with one or 
more hydrogen atoms) 

2. Not more than 10 hydrogen bond acceptors (nitrogen or oxygen atoms) 

3. A molecular mass less than 500 daltons 


4. An octanol-water partition coefficient log P not greater than 5 


QSAR and molecular descriptors 


ambo o o 


Substituent constant 
a 


Electronic descriptos | 
Hammett constants OA constants 
E parameter ___f 


Steric descriptors | 
Taft's steric parameter AA 





1. Hydrophobic parameters 
Clearly absorption and distribution processes in biological systems are determined by the 


hydrophilic or hydrophobic properties of molecules, for which the partition coefficient, P, of 
a molecule is used. P is defined by: 
— [drag] tana 


ame. 
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By using a logarithmic relationship, P becomes an additive property: 
logP = log [drug ican — 10E [AME banter 
The contribution of a substituent, X, to the logP of a molecule is defined by 
My =108 Fipetinvea 7198 Farre 
compo compo 


= og Fax 
Fru 


m is the hydrophobic parameter for a specific substituent. Extensive measurements and use 
of theoretical correlations have resulted in a large number of tabulated logP andn values. 
Typical values for T for different substituents are: 


substituent Si sa m0 
I «=e E AA 
UE 98 OMT 
au ii iii Or: 
The usefulness of 1 values is to estimate logP of a substituted compound, given the logP of a 
parent molecule. For example, 
LogP pheno! = 1OgPbenzene + Mou =2.13 + (-0.67) = 1.46 
[drug loam _ 2, 884 
TE aa 100 
LogPiotuene = logPhenzene + TcH3 = 2.13 + (0.56) = 2.69 
p — [druglame _ 48,900 





P = 





[dm S| ter 100 

logPpaBpa = logPrenzene + Tico2H + TINH? = 2.13 + (-0.32) + (-1.23) = 0.58 
_ [drug] _ 380 
[dragh 100 


Examples 


Log P values are dependent on the experimental method, as shown for the recorded values for 
phenobarbital, C12H12N203. Where a molecule possesses one or more ionizable groups, each of 
which makes a contribution to the overall partition coefficient, then logD would need to be used 
particularly at pH values far from that of the neutral species of interest. Based on correlations 
between experimentally measured and theoretical values, contributions to logP have been 
calculated for molecular fragments, enabling an estimate to be made for the logP of a 
molecule based on the structural formula. 
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B atituric acid Phenobarbital 


For example, for phenobarbital, a logP of 1.71 was estimated using the following fragmental 
constants: 


CRS | aay) Oo Ss) Cla aa s ) =(0) 106 
ea) sO) re = (Cis Caos 
KH, OF SSL barometer “= 265 
Interactions between aliphatic and aryl systems: 0.35 
Inperactions Whee Dar De Ura Bens ys irem: Beas, a 


logP = 5(0.37) -0.08 -0.22 + 0.53 + 0.91 -2.65 + 2(0.35) + 0.67 = 1.71 


Note that the -C=O and -NH- fragments of the barbituric acid portion have a combined effect on 
logP when closely associated, as in these molecules, that is different from their values in 
isolation. As a result, barbituric acid has a calculated logP of -1.41. Similarly, the starred atoms 
(*) differ from normal sp” and sp? atoms in their intramolecular interactions depending on 
whether they are in barbituric acid, or phenobarbital, where extra effects occur. 


Anticonvulsant activity of a diverse series of drugs has been shown to be related to logP by the 
equation: 


log (1/C) = 0.73logP + 2.5 


R 
H 
var aS - Hs- ; -(- Í - -N Hs- 
On “N 0 
H 


This implies that anticonvulsant activity would increase indefinitely by increasing the 
lipophilicity of the drugs. Clearly, however, there must be a limit to this increase: lipophilic 
molecules could become trapped in fat depots, thereby decreasing their availability and activity. 


Extension of the above study to include more lipophilic molecules supports this, giving overall 
a parabolic relationship. By differentiation, this gives an optimum log P value of approximately 
1.75, a value typical for most CNS-active drugs. 
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(1 .75,4.23) 


Logi1 iC) log (140) = -0.4 (ogP)* +14 logP + 3.0 


dbg LC) 
dbgP 
=o 
when  IogP=14/0.8=1.75 


= -04x2 logP +14 








log{ P} 


2. Electronic parameters 
Electronic properties were initially developed from a consideration of substituent effects in 


aromatic compounds. For example, the dissociation constants of substituted benzoic acids 
(Kx10° at 25° C) were used by Hammett 


K = 
Waza +H,0 =s CO, + H0 
R R 


a = Se ee a ee ee 





What is the effect of R on the acidity of the acids? 





acid form is stabilized if R is electron donating, therefore, the equilibrium lies to the left 


anion form is stabilized if R is electron-withdrawing, therefore, the equilibrium lies to the right 


Electronic properties can be quantified by Ka, as follows: 


log Ky pabetintedacit) ” log Ky pmatstitedacid) 7 = log K, pgi- log Kama 
Kam 
ca 


= log 
=a 
where g is the substituent constant for a given group, R, and its position, and K, are acid 


dissociation constants. It is possible to have Go, Gm and Gp values. Typical values of the 
substituent constants, &m and Gp are shown in the following table. 





o 708 TO 10337 10.067 
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gis transferable to many reactions involving benzene and other aromatic species, leading to a 
generalized form of the equation known as the Hammett equation: 


po = log E, m - log Em 
3. Steric parameters 


Taft quantified the steric (spatial) effects using the hydrolysis of esters: 


C Q 
Rr + 10 = R + CHH 
OCHs On 


Here, the size of R affects the rate of reaction by blocking nucleophilic attack by water. 


nucleophilic attack on 


nN carbonyl carbon 
e 8- 


-i — O aa Ht Q 
R R (R- 5+ + 40 == RA +  CH,0H 
OCH; OH 


increase inR 





decrease ink 


In this case, the steric effects were quantified by the Taft parameter E,: 


Es = logkgcocm - log Keu.coces 


= lo E kee OC H: 


kenc CH: 


k is the rate constant for ester hydrolysis. This expression is analogous to the Hammett 


equation. 

Compare some extreme values: H 1.24 +ve value: little steric resistance to hydrolysis 
Me 0.00 the reference substituent in the Taft equation 
t-Bu -2.78 -ve value: large resistance to hydrolysis 


Typical E, values are: 


Es Values for Various Substituents 


Me | Pr | tBu | F | C | Br | OH | SH | NO, | CH; | CN | NB, | 
oo E249 a E278 E0469 aa E055 N07 E2529 3829) o E06 
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Note: H is usually used as the reference substituent (E,°), but sometimes when another group, 
such as methyl (Me) is used as the reference, as in the chemical equation above, the value 
becomes 1.24. 

As was the case for 5 , Es may be used in other chemical reactions and to explain biological 
activities, for example the hydrolysis of inhibitors of acetylcholine esterase. 


6- 
EN ane 
= z ~OEt 
cad \ 


HO—serine-PROTEIN 


Organophosphates must be hydrolysed to be active and it is observed that their biological 
activity is directly related to the Taft steric parameter Es for the substituent R by the equation: 


log L/C) = 2.58E, + 7.94 


e See if you can deduce correlations for the following two sets of data. 


Esters, Ketones, and Ethers Alcohols AA 
log P 
1.27 
-0.75 
| C:H:COCH; | 110 | -0.27 | (CH3),CHOH | 0.90 | -0.36 
HCO CHs | 120 | -0.38  (CH;;COH | 0.90 7 0.07 
-0.23 
0.59 
0.29 
0.16 
0.81 


0.66 | 
| ©H:;COC:Hs | 200 | 066] o oo 
| (CH3)2CHCO2CHs | 2.20 | 105 | 


(ii) Growth Inhibition for Hamster Ovary Cells 

= R | IComM_| LogiACso | O * oo t 
KA ee KAA AI WA WA WA 
| mNO, | 969820 | OTB 
| pPSMe | 162 | -0210 | NA 
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0.079 | 0.06 | aa 
0.347 
0.431 


0.071 1149 | 066 | 23 
0.420 





e Salicylamides. Define the terms logP and TL, and briefly describe how they are obtained. The influence 
of the aromatic substituent X in a series of monosubstituted salicylamides on their ability to displace 
[°H]-spiperone (measured by ICso values in nM) leads to the equation 


i Son N salicylamide 
| 


derivative 
OCH: oan 


log(1/ICso) = 1.28 n- 0.52 n° + 1.54 
n=12 r=0.94 s=0.40 F=19.9 


From the equation and the table below, what is the best value for n, the corresponding ICs5o 
value, and best substituent, X? 


Substituent H F Cl Br | NO, CH; CH; CH; C3H3 


= 0.00 0.14 0.71 0.86 1.12 -0.28 |0.56 1.02 1.14 (1.55 


e Fluoroquinolone antibacterials. In QSAR studies of fluoroquinolone antibacterial agents, 


7 3 





ofloxacin 
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the following two eguations were used to determine the best substituents for positions 
1 and 8: 


for position 1: log (1/MIC) = -0.49 (L)? + 4.10 (Li) - 2.00 
for position 8: log (1/MIC) = -1.00 (Ba) + 3.73 (B4) + 1.3 
(a) Define the parameters L;, B4 and MIC. 
(b) From the equations 
e determine the optimal values for L, and By. Use the table of L; and B, values given below to 
select the best groups for positions 1 and 8. 


e explain how this information contributed to the design of ofloxacin. 


(c) Briefly describe the mode of action of the fluoroquinolone antibacterial agents and suggest 
how the QSAR results support this mechanism. (See T. Rosen's work) 


Table of L; and B; values 


Substituents H F Cl Br |l Me Et i-Pr t-Bu i-Pentyl 
L 2.06 |2.65 |3.52 |3.83 |4.23 |3.00 |4.11 |4.11 |4.11 |4.73 
By 1.00 |1.35 |1.80 |1.95 |2.15 |2.04 |2.97 |3.16 |2.97 |3.99 


e Phthalides from Celery. Since antiquity, celery extracts have been used in Chinese folk medicine to 
treat hypertension. More recently, investigations have also shown that celery seeds contain an oily 
liquid, 3-(S)-n-butylphthalide, which also has potential for development as an anti-epileptic agent. 


H 3 Hs CHa 4 H 3, CH 
E CHS 5 | uT NCH, 
© Oa 2 
s Fg 6 T 


j 14 
D 


6-substituted 
23-51- butylphthalide 3-/5-n-butyiphthalides 


QSAR analysis of a set of 6-substituted analogues of 3-(S)-n-butylphthalide gave the following 
Hansch equation. In the equation, EDs9 (mol kg") is the effective dose to protect mice against 
spasms induced by maximum electric shock. 
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1 
log EDsn =0.74 log P - 0.13 (log P )” + 0. 35 MRs + 2.23 





= 274 SUI SSS Is so 
(a) Define the terms log P and MR.6. How does log P differ from the parameter, 1. ? 


(b) Comment briefly on the significance of the values of n, r, s and F. How valid is this 
equation? 


(c) Determine the best combination of values for log P and MRg 
(d) From the table given below, what is the best substituent for the 6-position? 


e Comment on the nature of the substituent you have selected in relation to the overall 
physicochemical properties of the compound. 


(e) Calculate the corresponding EDso for your 6-substituted 3-(S)-n-butylphthalide. 


1 


(f) The 6-amino ( 6-NH.- ) analogue, has a log EU's of 4.40. Compare this value with your 
EDso and offer a possible explanation for any difference. 





Rs |H [OH |Cl_|Br_ | NO, | CN | CONH, | 


-logP | 3.12 | 3.12 | 3.53 | 4.24 | 2.92 | 2.39 | 2.25 | 
MRs | 0.10 | 0.29 | 0.60 | 0.80 | 0.74 | 0.63 | 0.98 | 





e Phorbol Esters. Two researchers studied the irritant properties of a series of phorbol esters. 


HO OH ROCO cor 


phorbol phorbol esters 





CH20H oe creer 


They plotted the same data for concentration, C, to produce skin irritancy in the mouse ear test 
versus log P values, but fitted the two different curves, A and B. The statistics for each curve- 
fitting are given under the plots. 
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log a ehh Mee log M: CURVE B 
11 A P 
10 A 10 
3 J 

3 5 





: ae I I 
4 B 10 12 14 logP 4 E 8 10 12 14 logP 


Curve A Statistics Curve B statistics 


n = 6; r = 0.98; s = 0.32; F = 32.39 in = 6; r= 1.00; s = 0.04; F = 1,389.47 


(a) Define log P and ti. 


(b) Decide which of the two plots gives the best fit to the data, giving reasons. From the curve 
you have chosen, what is the best value for log P and for C? 


(c) From the table below of mvalues of alkyl groups, calculate which group gives the 
best activity for phorbol esters. Log P of the parent molecule, phorbol, is -0.12. 


R ethyl(C,) butyl(C,) | hexyl(Cs) octyl(Cs) decyl(Cio) |dodecyl(C,2) tetradecyl(C,,) 


T 0.87 1.93 3.00 4.06 5.12 6.18 7.25 


(d) Give an explanation for the trend shown by the curve shapes above in terms of 
the structures and physicochemical properties of the phorbol esters. 
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Lab#5 


Molecular database and QSAR analysis 


Task: building the molecular database using software, by creation of fields and adding compounds 
in them. 

Database will be built using the down compounds. QSAR study of substituted 2-pyridiny| 
guanidines as selective urokinase-type plasminogen activator (uPA) inhibitors 


Table TI. Structure, biological activity and physicochemical descriptors of series 2 


Sa we 
gare ies 


| 


ai i 


H-N- NH, 





5.No R. BA (pM) —logBA chly C b_double E sal PMI — £ logP (ow) 
1 (E)-CH—CHCO,H 1.5 5.8239 2.0267 3 — 15.3085 1328.947 1.04 
2 (E)-CH=CHCONMH Me 1.28 5.8928 2.0267 3 — 15.1895 1889.878 0.662 
3 E)-CH—CHCO( l-morpholino) 2,1 5.6778 3.0267 3 — 15.4765 36] 3822 0.425 
4 (E)-CH=CHCONHCH>3Ph O77 6.1135 4.2909 3 — 12.4917 1392.558 2.45 
5 (E)-CH=CHCON(Me)CH3Ph 2 5.6990 4.2909 3 — 6.5011 2556.398 2.647 
6 CH,CH,CO,H 9.31 5.0311 2.3251 2 — 14.8113 638.494] 0.572 
7 OCH,;CO3H 90.2 4.0448 1.4696 2 — 13.5755 609.7509 0.18 
8 OCH-CONHCH-Fh 16.1 4.7932 3.7338 2 — 9.0434 353.2897 1.59 
9 OCH,CH-,0CH, 16.4 4.7852 1.6160 l — 11.0577 1442.224 0.345 
10 C,H; 166.3 3.7791 3.2767 l — 9.12613 939.6092 2.679 
ll OC,H; 2.13 5.6716 3.0267 l — 12.9468 484.1166 2.329 
12 OCH-C:H; 0.92 6.0362 3.3803 l — 13.5082 2463.076 2.463 
13 C=CC,H; O.77 6.1135 3.7767 1 — 14.6776 1390.561 3.425 
14 (E)-CH=CHC,H; 0.55 6.2596 3.9374 2 — 15.2539 1335.126 3.323 
15 (E)-CH=CHCHĦ 1 0.73 6.1567 4.8879 2 — 10.8807 118.8222 3.338 
16 (E)-CH>=CH (2-pyndinyl) 1.6 5.7959 3.3154 2 — 15.3939 1285.165 2.052 
17 (E)-CH=CH(4-C, H,0OMe) 0.69 6.1611 3.8481 z2 — 12.8072 1330.748 3.279 
18 E)-CH>CH[(3,4-0CH,0)C,Hy] 0.49 6.3098 3.7647 2 — 14.2263 2392.332 3.028 
19 (E)-CH=CH/(3-C,H,CO.H) 0.17 6.7695 4.098] 3 — 14.1573 2820.975 3.037 


— log BA — 0.544 (2 0.164 pchilw_tc 
— © =<. 1560C+ 0.0 69E Ss OL. 


+ 1.505(4+ 1.220) 


— log BA — 0.41 9( >= 0.099 Hog PLO, ww} 
— O=<.110(+0.045)E Sol 
+ OS Y2 SHS 0O. 139b dowble 


+ nO a = TANGI T5} 
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JJ | 1.23 | -0.67 | -0.57 | -0.28 | 0.0 | 0.14 | 0.270 | 0.38 | 0.56 | 0.71 | 0.88 | 1.18 _ 
| -0.66 | -0.37 | 0.66 | 0.78 | 0.0 | 0.06 | 0.70 | -0.24 | -0.17 | 0.23 | 0.54 | -0.90 _ 





- Calculate Lipinsk1’s rule for all. 
- Which compound is polar, and which is not? 
- Measure different molecular descriptors that could describe the compounds. 


Log 1/C = -0.31n° + 0.141 +1.29 64 4.56 
n= 10 r = 0.87 s = 0.47 


C is the EDs, concentration (moles per kilogram), in the anti-pentylenetetrazol induced seizure 
test in mice, a measure of anticonvulsant activity. 


(a) Define 1 and 5. 
(b) Comment briefly on the validity of the equation. 
(c) Determine the best values for n anda that give maximum anticonvulsant activity. 


(d) From the table below, which substituent is the best choice to give this maximum? What 
physicochemical properties does the substituent have? 


(e) If log P is 2.11 for the unsubstituted |,4-benzodiazepinone, what is the log P for your 
substituted 1,4-benzodiazepinone? 

e Salicylamides. A series of mono-substituted salicylamides was investigated to determine the 
influence of the aromatic substituent, X, on their ability to displace "H-spiperone from 
dopamine D> receptors in vitro. Analysis of the resulting data gave the following Hansch 
equation: 





OH O 
x Pen 

{ OCHs EHe 
log ICs = 1.287 - 0.5207 - 0.695, + 1.50 salicylamides 


n = 12, r” = 0.94, s = 0.40, F = 19.9 

(a) How valid is the equation? Briefly describe how this type of equation is obtained. 
(b) What are the best values for the substituent parameters, 1 and Sw? 

(c) From the table of substituents, what is the best substituent for X? 


(d) What is the ICso for the salicylamide with the substituent you have chosen? 
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(e) Briefly comment on how the properties of this substituent contribute to the activity of the 
salicylamide. 


(f) Sketch the general shape of the 3-D plot for the above equation. 
X CONH: NH; OH CN NHCH; NO, NHC,H; NMe, |NHC3H7 | NHC,Ho 
T |-1.49 |-1.23 -0.67 -0.57 |-0.47 -0.28 0.08 0.18 (0.62 1.16 


Cm 10.28 -0.16 }0.12 {0.56 |-0.30 {0.71 |-0.24 -0.15 |-0.24 -0.34 
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Assay of Sulpha Drugs (Sulphamethoxazole) 


yA 
a Pa ee as 
\,  \ 


N— O 
H-N 
Assay using diazotization method 


HCI, NaNO; titrant, Starch-lodide paper indicator 
Yellow brown to blue color 


Principle: 


Sulphamethoxazole being a primary aromatic amine will undergo reaction with sodium nitrite in acidic 
condition (nitrous acid) to form a diazonium salt. Observation of end point depends on determination 
of small excess of nitrous acid which is determined by using the starch-lodide paper as an external 

indicator. The iodine liberated reacts with starch to form a blue color. 








CH CH, 
f / ) 
ai — 
— ATA 
| ) | ` 
| 0 | o 
Paie AA 
HN N HN N 
oO=s=—0 O—s—O 
+ NaNO; + HCl ä > | + NaCl + MO 
ae ae we —— 
f e Sodium nitrite | D 
i a a — 
NH; NOC] 
Sulphamethoxazole Diazonium salt 
Starch 


HNO, + 2 Na | ————3> |], —— y Blue color 


Diazotization titrations method: 


Weigh accurately about 0.5 g of Sulphamethoxazole sample and dissolve in 50 ml of 2M hydrochloric 
acid. Cool it in ice bath and add 2 gm of potassium iodide (KI). Dissolve and titrate slowly with 0.1 M 
sodium nitrite solution. The temperature should never go above 152C. continue titration until yellow 
brown color appear. The end point is determined by placing one small drop of solution is drawn from 
the conical flask with the help of glass rod on to a starch iodide paper. Appearance of blue color is the 

end point. 


Calculation: Each ml of 0.1 M NaNO, equivalent to 0.02528 gm of Ci 9H,,N303S. 


Report: The given sample contains ------------- gm of Sulphamethoxazole. 
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Chemical identification 
Azodye test 

FeCl; test 

Spectroscopic identification 


HNMR Spectrum 
N 
siii N 7.61 x 
NH NH, 6.27 
ya 
6.09 a 
2.36 O 6.61 
\\ 7.61 
8 7 6 5 4 3 2 l 

PPM 

IR Spectrum 


SULFANILAMIDE, N^ 1-(5-METHYL-3-ISOXAZOLYL)- 
INFRARED SPECTRUM 


a ' 
ma i i, 


| ` 


Transmitance 





4000 3000 2000 1000 
Wavenumber (crm-1) 


NIST Chemistry WebBook (https://webbook.nist.gov/chemistry) 


I 
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Ligand-target interaction 


Predicting interactions between small molecules and proteins is a key element in 


the drug discovery process. In particular, several classes of proteins such as G- 





rotein-coupled receptors (GPCR), enzymes and ion channels represent a large 


fraction of current drug targets and important targets for new drug development. 


c. Types of ligand-target interactions: 


ee pie a ee N 
O 
| Hydrogen bonding interaction (ACCEPTOR and 
ya DONOR 
ge Banzi SS ) 
,0 
{© ®© HN Ionic interaction (SALT BRIDGE) 
“oH 
n Metal ion complex 

Hydrophobic interaction (Aromatic stacking) 
tk 





La ol x Cation--aromatic (bi) interaction 
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Structure of Mycobacterium tuberculosis CYP121 in 
complex with the antifungal drug fluconazole 


. Open the crystal structure of CYP121 complex with fluconazole in MOE 
(21J7 PDB) 

. CLICK on the right corner SEQ 

. Analyze the structure through , how many amino acids there, drugs, number of 
H20 molecules, metals, ions, or others 

. MOE/Compute / ligand interaction 

. In the moe window of interaction /isolate to look deep and extract the binding 
information: 

. Binding pocket amino acids 


2. Types of interaction 


. Fragments of ligand involved in the interaction 
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A30 
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